Overview

Dataset statistics

Number of variables10
Number of observations214
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory16.8 KiB
Average record size in memory80.3 B

Variable types

Numeric10

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
RI is highly correlated with Si and 1 other fieldsHigh correlation
Na is highly correlated with KHigh correlation
Mg is highly correlated with Al and 1 other fieldsHigh correlation
Al is highly correlated with Mg and 1 other fieldsHigh correlation
Si is highly correlated with RIHigh correlation
K is highly correlated with NaHigh correlation
Ca is highly correlated with RIHigh correlation
Ba is highly correlated with TypeHigh correlation
Type is highly correlated with Mg and 2 other fieldsHigh correlation
RI is highly correlated with Si and 1 other fieldsHigh correlation
Na is highly correlated with TypeHigh correlation
Mg is highly correlated with TypeHigh correlation
Al is highly correlated with TypeHigh correlation
Si is highly correlated with RIHigh correlation
Ca is highly correlated with RIHigh correlation
Ba is highly correlated with TypeHigh correlation
Type is highly correlated with Na and 3 other fieldsHigh correlation
RI is highly correlated with CaHigh correlation
Ca is highly correlated with RIHigh correlation
RI is highly correlated with Na and 5 other fieldsHigh correlation
Na is highly correlated with RI and 5 other fieldsHigh correlation
Mg is highly correlated with Na and 4 other fieldsHigh correlation
Al is highly correlated with RI and 7 other fieldsHigh correlation
Si is highly correlated with RI and 5 other fieldsHigh correlation
K is highly correlated with RI and 3 other fieldsHigh correlation
Ca is highly correlated with RI and 7 other fieldsHigh correlation
Ba is highly correlated with RI and 3 other fieldsHigh correlation
Type is highly correlated with Na and 4 other fieldsHigh correlation
Mg has 42 (19.6%) zeros Zeros
K has 30 (14.0%) zeros Zeros
Ba has 176 (82.2%) zeros Zeros
Fe has 144 (67.3%) zeros Zeros

Reproduction

Analysis started2022-01-28 01:47:11.260919
Analysis finished2022-01-28 01:47:57.874805
Duration46.61 seconds
Software versionpandas-profiling v3.1.0
Download configurationconfig.json

Variables

RI
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct178
Distinct (%)83.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.518365421
Minimum1.51115
Maximum1.53393
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:47:58.161223image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum1.51115
5-th percentile1.515401
Q11.5165225
median1.51768
Q31.5191575
95-th percentile1.523664
Maximum1.53393
Range0.02278
Interquartile range (IQR)0.002635

Descriptive statistics

Standard deviation0.003036863739
Coefficient of variation (CV)0.002000087527
Kurtosis4.931737386
Mean1.518365421
Median Absolute Deviation (MAD)0.001265
Skewness1.625430506
Sum324.9302
Variance9.222541372 × 10-6
MonotonicityNot monotonic
2022-01-28T03:47:58.498034image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.51593
 
1.4%
1.516453
 
1.4%
1.521523
 
1.4%
1.516462
 
0.9%
1.518292
 
0.9%
1.522132
 
0.9%
1.515962
 
0.9%
1.517112
 
0.9%
1.517692
 
0.9%
1.516182
 
0.9%
Other values (168)191
89.3%
ValueCountFrequency (%)
1.511151
0.5%
1.511311
0.5%
1.512151
0.5%
1.512991
0.5%
1.513161
0.5%
1.513211
0.5%
1.514091
0.5%
1.515081
0.5%
1.515142
0.9%
1.515311
0.5%
ValueCountFrequency (%)
1.533931
0.5%
1.531251
0.5%
1.527771
0.5%
1.527391
0.5%
1.527251
0.5%
1.526671
0.5%
1.526641
0.5%
1.526141
0.5%
1.524751
0.5%
1.52411
0.5%

Na
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct142
Distinct (%)66.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.40785047
Minimum10.73
Maximum17.38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:47:58.816046image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum10.73
5-th percentile12.415
Q112.9075
median13.3
Q313.825
95-th percentile14.8535
Maximum17.38
Range6.65
Interquartile range (IQR)0.9175

Descriptive statistics

Standard deviation0.8166035557
Coefficient of variation (CV)0.06090488238
Kurtosis3.052232409
Mean13.40785047
Median Absolute Deviation (MAD)0.435
Skewness0.4541814537
Sum2869.28
Variance0.6668413672
MonotonicityNot monotonic
2022-01-28T03:47:59.086973image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13.215
 
2.3%
13.025
 
2.3%
135
 
2.3%
13.644
 
1.9%
13.334
 
1.9%
12.864
 
1.9%
13.244
 
1.9%
12.854
 
1.9%
12.873
 
1.4%
13.413
 
1.4%
Other values (132)173
80.8%
ValueCountFrequency (%)
10.731
0.5%
11.021
0.5%
11.031
0.5%
11.231
0.5%
11.451
0.5%
11.561
0.5%
11.951
0.5%
12.161
0.5%
12.21
0.5%
12.31
0.5%
ValueCountFrequency (%)
17.381
0.5%
15.791
0.5%
15.151
0.5%
15.011
0.5%
14.991
0.5%
14.952
0.9%
14.941
0.5%
14.921
0.5%
14.862
0.9%
14.852
0.9%

Mg
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct94
Distinct (%)43.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.68453271
Minimum0
Maximum4.49
Zeros42
Zeros (%)19.6%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:47:59.394826image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12.115
median3.48
Q33.6
95-th percentile3.85
Maximum4.49
Range4.49
Interquartile range (IQR)1.485

Descriptive statistics

Standard deviation1.442407845
Coefficient of variation (CV)0.5373031364
Kurtosis-0.4103189629
Mean2.68453271
Median Absolute Deviation (MAD)0.205
Skewness-1.152559318
Sum574.49
Variance2.080540391
MonotonicityNot monotonic
2022-01-28T03:47:59.641272image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
042
 
19.6%
3.588
 
3.7%
3.548
 
3.7%
3.488
 
3.7%
3.527
 
3.3%
3.625
 
2.3%
3.664
 
1.9%
3.614
 
1.9%
3.54
 
1.9%
3.574
 
1.9%
Other values (84)120
56.1%
ValueCountFrequency (%)
042
19.6%
0.331
 
0.5%
0.781
 
0.5%
1.011
 
0.5%
1.351
 
0.5%
1.611
 
0.5%
1.711
 
0.5%
1.741
 
0.5%
1.781
 
0.5%
1.831
 
0.5%
ValueCountFrequency (%)
4.491
 
0.5%
3.981
 
0.5%
3.971
 
0.5%
3.931
 
0.5%
3.93
1.4%
3.891
 
0.5%
3.871
 
0.5%
3.861
 
0.5%
3.852
0.9%
3.841
 
0.5%

Al
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct118
Distinct (%)55.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.444906542
Minimum0.29
Maximum3.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:47:59.948373image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0.29
5-th percentile0.696
Q11.19
median1.36
Q31.63
95-th percentile2.394
Maximum3.5
Range3.21
Interquartile range (IQR)0.44

Descriptive statistics

Standard deviation0.4992696456
Coefficient of variation (CV)0.3455376739
Kurtosis2.060568969
Mean1.444906542
Median Absolute Deviation (MAD)0.21
Skewness0.907289809
Sum309.21
Variance0.249270179
MonotonicityNot monotonic
2022-01-28T03:48:00.299714image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.548
 
3.7%
1.196
 
2.8%
1.295
 
2.3%
1.435
 
2.3%
1.565
 
2.3%
1.235
 
2.3%
1.354
 
1.9%
1.284
 
1.9%
1.364
 
1.9%
1.523
 
1.4%
Other values (108)165
77.1%
ValueCountFrequency (%)
0.291
0.5%
0.341
0.5%
0.472
0.9%
0.511
0.5%
0.562
0.9%
0.581
0.5%
0.651
0.5%
0.661
0.5%
0.671
0.5%
0.711
0.5%
ValueCountFrequency (%)
3.51
0.5%
3.041
0.5%
3.021
0.5%
2.881
0.5%
2.791
0.5%
2.741
0.5%
2.681
0.5%
2.661
0.5%
2.541
0.5%
2.511
0.5%

Si
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct133
Distinct (%)62.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.65093458
Minimum69.81
Maximum75.41
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:48:00.687020image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum69.81
5-th percentile71.315
Q172.28
median72.79
Q373.0875
95-th percentile73.5175
Maximum75.41
Range5.6
Interquartile range (IQR)0.8075

Descriptive statistics

Standard deviation0.7745457948
Coefficient of variation (CV)0.0106611952
Kurtosis2.967902956
Mean72.65093458
Median Absolute Deviation (MAD)0.385
Skewness-0.7304472251
Sum15547.3
Variance0.5999211882
MonotonicityNot monotonic
2022-01-28T03:48:00.930178image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
72.994
 
1.9%
73.14
 
1.9%
73.114
 
1.9%
73.284
 
1.9%
72.864
 
1.9%
71.993
 
1.4%
72.673
 
1.4%
73.213
 
1.4%
73.083
 
1.4%
72.953
 
1.4%
Other values (123)179
83.6%
ValueCountFrequency (%)
69.811
0.5%
69.891
0.5%
70.161
0.5%
70.261
0.5%
70.431
0.5%
70.481
0.5%
70.571
0.5%
70.71
0.5%
71.151
0.5%
71.241
0.5%
ValueCountFrequency (%)
75.411
0.5%
75.181
0.5%
74.551
0.5%
74.451
0.5%
73.881
0.5%
73.811
0.5%
73.751
0.5%
73.721
0.5%
73.71
0.5%
73.611
0.5%

K
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct65
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.4970560748
Minimum0
Maximum6.21
Zeros30
Zeros (%)14.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:48:01.189931image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.1225
median0.555
Q30.61
95-th percentile0.76
Maximum6.21
Range6.21
Interquartile range (IQR)0.4875

Descriptive statistics

Standard deviation0.6521918456
Coefficient of variation (CV)1.312109194
Kurtosis54.68969853
Mean0.4970560748
Median Absolute Deviation (MAD)0.115
Skewness6.55164831
Sum106.37
Variance0.4253542034
MonotonicityNot monotonic
2022-01-28T03:48:01.428787image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
030
 
14.0%
0.5712
 
5.6%
0.5611
 
5.1%
0.611
 
5.1%
0.5810
 
4.7%
0.618
 
3.7%
0.648
 
3.7%
0.597
 
3.3%
0.556
 
2.8%
0.626
 
2.8%
Other values (55)105
49.1%
ValueCountFrequency (%)
030
14.0%
0.021
 
0.5%
0.031
 
0.5%
0.042
 
0.9%
0.051
 
0.5%
0.064
 
1.9%
0.071
 
0.5%
0.084
 
1.9%
0.092
 
0.9%
0.11
 
0.5%
ValueCountFrequency (%)
6.212
0.9%
2.71
0.5%
1.761
0.5%
1.681
0.5%
1.461
0.5%
1.411
0.5%
1.11
0.5%
0.971
0.5%
0.811
0.5%
0.762
0.9%

Ca
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct143
Distinct (%)66.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.956962617
Minimum5.43
Maximum16.19
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:48:01.739657image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum5.43
5-th percentile7.8125
Q18.24
median8.6
Q39.1725
95-th percentile11.5615
Maximum16.19
Range10.76
Interquartile range (IQR)0.9325

Descriptive statistics

Standard deviation1.423153487
Coefficient of variation (CV)0.1588879566
Kurtosis6.681977951
Mean8.956962617
Median Absolute Deviation (MAD)0.445
Skewness2.047053913
Sum1916.79
Variance2.025365848
MonotonicityNot monotonic
2022-01-28T03:48:02.078992image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8.035
 
2.3%
8.435
 
2.3%
9.574
 
1.9%
8.444
 
1.9%
8.794
 
1.9%
8.213
 
1.4%
8.393
 
1.4%
8.553
 
1.4%
8.523
 
1.4%
8.763
 
1.4%
Other values (133)177
82.7%
ValueCountFrequency (%)
5.431
0.5%
5.791
0.5%
5.871
0.5%
6.471
0.5%
6.651
0.5%
6.931
0.5%
6.961
0.5%
7.081
0.5%
7.361
0.5%
7.591
0.5%
ValueCountFrequency (%)
16.191
0.5%
14.961
0.5%
14.681
0.5%
14.41
0.5%
13.441
0.5%
13.31
0.5%
13.241
0.5%
12.51
0.5%
12.241
0.5%
11.641
0.5%

Ba
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct34
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.175046729
Minimum0
Maximum3.15
Zeros176
Zeros (%)82.2%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:48:02.338753image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1.57
Maximum3.15
Range3.15
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4972192606
Coefficient of variation (CV)2.840494441
Kurtosis12.54108358
Mean0.175046729
Median Absolute Deviation (MAD)0
Skewness3.416424569
Sum37.46
Variance0.2472269931
MonotonicityNot monotonic
2022-01-28T03:48:02.582197image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
0176
82.2%
0.092
 
0.9%
1.592
 
0.9%
0.112
 
0.9%
0.642
 
0.9%
1.572
 
0.9%
1.641
 
0.5%
1.061
 
0.5%
0.811
 
0.5%
1.191
 
0.5%
Other values (24)24
 
11.2%
ValueCountFrequency (%)
0176
82.2%
0.061
 
0.5%
0.092
 
0.9%
0.112
 
0.9%
0.141
 
0.5%
0.151
 
0.5%
0.241
 
0.5%
0.271
 
0.5%
0.41
 
0.5%
0.531
 
0.5%
ValueCountFrequency (%)
3.151
0.5%
2.881
0.5%
2.21
0.5%
1.711
0.5%
1.681
0.5%
1.671
0.5%
1.641
0.5%
1.631
0.5%
1.592
0.9%
1.572
0.9%

Fe
Real number (ℝ≥0)

ZEROS

Distinct32
Distinct (%)15.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05700934579
Minimum0
Maximum0.51
Zeros144
Zeros (%)67.3%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:48:02.816160image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.1
95-th percentile0.267
Maximum0.51
Range0.51
Interquartile range (IQR)0.1

Descriptive statistics

Standard deviation0.09743870064
Coefficient of variation (CV)1.709170651
Kurtosis2.662015617
Mean0.05700934579
Median Absolute Deviation (MAD)0
Skewness1.75432747
Sum12.2
Variance0.009494300382
MonotonicityNot monotonic
2022-01-28T03:48:03.062001image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
0144
67.3%
0.247
 
3.3%
0.177
 
3.3%
0.096
 
2.8%
0.15
 
2.3%
0.114
 
1.9%
0.283
 
1.4%
0.073
 
1.4%
0.143
 
1.4%
0.223
 
1.4%
Other values (22)29
 
13.6%
ValueCountFrequency (%)
0144
67.3%
0.011
 
0.5%
0.031
 
0.5%
0.051
 
0.5%
0.061
 
0.5%
0.073
 
1.4%
0.082
 
0.9%
0.096
 
2.8%
0.15
 
2.3%
0.114
 
1.9%
ValueCountFrequency (%)
0.511
 
0.5%
0.371
 
0.5%
0.351
 
0.5%
0.341
 
0.5%
0.321
 
0.5%
0.311
 
0.5%
0.31
 
0.5%
0.291
 
0.5%
0.283
1.4%
0.261
 
0.5%

Type
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct6
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.780373832
Minimum1
Maximum7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.7 KiB
2022-01-28T03:48:03.368899image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile7
Maximum7
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.103738646
Coefficient of variation (CV)0.7566387736
Kurtosis-0.2795182977
Mean2.780373832
Median Absolute Deviation (MAD)1
Skewness1.114915201
Sum595
Variance4.425716292
MonotonicityIncreasing
2022-01-28T03:48:03.588340image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
276
35.5%
170
32.7%
729
 
13.6%
317
 
7.9%
513
 
6.1%
69
 
4.2%
ValueCountFrequency (%)
170
32.7%
276
35.5%
317
 
7.9%
513
 
6.1%
69
 
4.2%
729
 
13.6%
ValueCountFrequency (%)
729
 
13.6%
69
 
4.2%
513
 
6.1%
317
 
7.9%
276
35.5%
170
32.7%

Interactions

2022-01-28T03:47:54.431325image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:14.358335image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:18.379081image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:20.947364image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:24.950008image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:30.771012image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:34.831816image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:40.386647image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:44.747791image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:49.937378image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:54.768025image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:14.978293image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:18.583145image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:21.189427image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:25.532882image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:31.357093image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:35.368152image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:40.697387image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:45.358700image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:50.331268image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:54.994439image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:16.284412image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:18.810879image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:21.638674image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:26.173122image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:31.842046image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:36.010970image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:41.361171image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:45.709096image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:50.706154image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:55.277911image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:16.599941image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:19.064931image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:22.065204image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:26.721911image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:32.329977image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:36.276407image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:41.727894image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:46.381962image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:51.049946image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:55.490028image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:16.827900image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:19.293368image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:22.643812image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:27.063028image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:32.590199image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:36.587099image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:41.994211image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:47.444161image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:51.344500image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:55.702027image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:17.060466image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:19.605011image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:23.087200image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:27.634104image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:32.888162image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:36.978254image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:42.318993image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:47.743036image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:51.782410image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:55.916090image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:17.476270image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:19.915031image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:23.325692image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:28.489789image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:33.400400image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:37.878105image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:42.839096image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:48.189038image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:52.324033image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:56.136609image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:17.669055image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:20.196120image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:23.867823image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:29.291077image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:33.663188image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:38.357011image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:43.379078image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:48.612071image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:53.103846image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:56.355798image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:17.927328image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:20.424932image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:24.299119image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:29.685161image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:33.995984image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:39.012939image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:43.803029image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:49.064323image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:53.663778image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:56.567113image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:18.146493image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:20.699742image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:24.618336image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:30.230906image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:34.370930image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:39.501265image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:44.304982image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:49.497420image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
2022-01-28T03:47:54.101678image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Correlations

2022-01-28T03:48:03.831655image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-01-28T03:48:04.185222image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-01-28T03:48:04.606074image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-01-28T03:48:04.961851image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-01-28T03:47:57.087043image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
A simple visualization of nullity by column.
2022-01-28T03:47:57.563082image/svg+xmlMatplotlib v3.3.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

RINaMgAlSiKCaBaFeType
01.5210113.644.491.1071.780.068.750.00.001
11.5176113.893.601.3672.730.487.830.00.001
21.5161813.533.551.5472.990.397.780.00.001
31.5176613.213.691.2972.610.578.220.00.001
41.5174213.273.621.2473.080.558.070.00.001
51.5159612.793.611.6272.970.648.070.00.261
61.5174313.303.601.1473.090.588.170.00.001
71.5175613.153.611.0573.240.578.240.00.001
81.5191814.043.581.3772.080.568.300.00.001
91.5175513.003.601.3672.990.578.400.00.111

Last rows

RINaMgAlSiKCaBaFeType
2041.5161714.950.02.2773.300.008.710.670.07
2051.5173214.950.01.8072.990.008.611.550.07
2061.5164514.940.01.8773.110.008.671.380.07
2071.5183114.390.01.8272.861.416.472.880.07
2081.5164014.370.02.7472.850.009.450.540.07
2091.5162314.140.02.8872.610.089.181.060.07
2101.5168514.920.01.9973.060.008.401.590.07
2111.5206514.360.02.0273.420.008.441.640.07
2121.5165114.380.01.9473.610.008.481.570.07
2131.5171114.230.02.0873.360.008.621.670.07

Duplicate rows

Most frequently occurring

RINaMgAlSiKCaBaFeType# duplicates
01.5221314.213.820.4771.770.119.570.00.012